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Abstract 

We present a first analysis of the clustering of SDSS galaxies using the distribution function of the 
sum of Fourier phases. This statistical method was recently proposed by one of the authors as a new 
probe of the phase correlations of cosmological density fields. Since the Fourier phases are statistically 
independent of the Fourier amplitudes, the phase statistic plays a complementary role to the conventional 
two-point statistics of galaxy clustering. In particular, we focus on the distribution functions of the phase 
sum over three closed wavevectors as a function of the triangle configuration. We find that the observed 
distribution functions of the phase sum are in good agreement with the lowest-order approximation from 
perturbation theory. For a direct comparison with observations, we construct mock catalogs from TV-body 
simulations taking account of the survey geometry, the redshift distortion, and the discreteness due to 
the limited number of data. Indeed the observed phase correlations for the galaxies in the range of the 
absolute magnitude, —22 < Mr < —18, agree well with those for A-dominated spatially flat cold dark 
matter predictions with erg = 0.9 evolved from the Gaussian initial condition. This agreement implies that 
the galaxy biasing is approximately linear in redshift space. Instead, assuming that the galaxy biasing 
is described by a quadratic deterministic function at A: < 0.03[27r/(/i~^Mpc)], we can constrain the ratio 
of the quadratic biasing parameter, 62, to the linear biasing parameter, 61, from the difference of phase 
correlations between observations and mock predictions. We find that the resulting is well fitted by 

b2/bi = 0.54(±0.06) — 0.62(±0.08)(T8 and is almost insensitive to the cosmology and luminosity in those 
ranges. Indeed, 62/^1 is nearly zero when erg = 0.9. 

Key words: cosmology: large-scale structure of universe — cosmology: observations — methods: 
statistical 



1. Introduction 

The spatial distribution of galaxies. Large Scale 
Structure (LSS), provides fundamental knowledge about 
the formation and evolution of the density structure 
in our universe. A recent measurement of the CMB 
(Cosmological Microwave Background) anisotropy by 
WMAP (Wilkinson Microwave Anisotropy Probe) puts 
a stringent constraint on the primordial density fluctu- 
ations, and favors the A-dominated spatially flat Cold 
Dark Matter (LCDM) model with a Gaussian initial con- 
dition (Spcrgel et al. 2003; Komatsu et al. 2003). One 
of the goals in the analysis of galaxy distribution is to 



test cosmological models in an independent and comple- 
mentary manner to the CMB analysis. Another goal is 
to understand galaxy clustering, with particular empha- 
sis on galaxy biasing, which is a statistical relation of 
the clustering between galaxies and the underlying dark 
matter. Recent wide-field galaxy surveys, such as SDSS 
(Sloan Digital Sky Survey) and 2dFGRS (Two Degree 
Field Galaxy Redshift Survey), indeed enable a detailed 
study of galaxy clustering with unprecedented accuracy. 

More conventional statistics for analyzing the galaxy 
distribution is the two-point correlation function, or the 
power spectrum in Fourier space. Two-point statistics 
have been extensively studied for more than 35 years 
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since the pioneer work by Totsuji and Kihara (1969), and 
have been apphed to various redshift surveys, inchiding 
SDSS (e.g., Tegmark et al. 2004a; Zehavi et al. 2005 for 
SDSS galaxies; Yahata ct al. 2005 for SDSS quasars). 
They, however, cannot fully describe the statistical na- 
ture of the galaxy distribution, because LSS is highly 
non-Gaussian due to the nonlinear gravitational evolu- 
tion and nonlinearity in the galaxy biasing. To charac- 
terize the non-Gaussian properties of the galaxy distri- 
bution requires higher order statistics beyond two-point 
statistics. Motivated by this requirement, various statis- 
tics have been introduced as complementary tools to two- 
point statistics for the analysis of galaxy clustering: higher 
order correlation functions (e.g., Peebles 1980), genus 
statistics (Gott et al. 1986; Hoyle et al. 2002; Hikage et 
al. 2002; Park et al. 2005), Minkowski functionals (Mecke 
et al. 1994; Hikage et al. 2003), minimum spanning trees 
(Barrow et al. 1985), void statistics (White 1979), and so 
on. While these statistics capture different parts of non- 
Gaussian features, they arc not completely independent 
of the two-point statistics. 

The Fourier transform, (5k, of a density fluctuation fleld 
is separately written in terms of the amplitude and the 
phase, 6*1^, as 

<5k= |<5k|cxp(20k). (1) 

The Gaussian fields have a uniform distribution of the 
Fourier phases over < < Stt. Therefore, char- 
acterizing the correlation of phases is expected to be 
a direct means to explore the non-Gaussian features. 
Furthermore, Fourier phases are statistically independent 
information of the power spectrum, which is defined by 
(5k 1^- Nevertheless, finding useful statistics of the Fourier 
phases is not easy, mainly because of the cyclic property of 
the phase. For example, the one-point phase distribution 
turns out to be essentially uniform, even in a strongly 
non-Gaussian field (Suginohara, Suto 1991). For this 
reason, previous studies of the Fourier phase have been 
mainly devoted to the evolution of phase shifts in individ- 
ual modes (Ryden, Gramann 1991; Soda, Suto 1992; Jain, 
Bertschinger 1998), or the phase differences between the 
Fourier modes (Scherrer et al. 1991; Coles, Chiang 2000; 
Chiang 2001; Chiang et al. 2002; Watts, Coles 2003). 

Matsubara (2003) recently proposed, as a new mea- 
sure of phase correlations, the distribution function of 
the "phase sum" , 6'ki + 0-k2 + • • • + ^kjv ■, where the cor- 
responding wavevcctors satisfy ki -I- k2 + • • • -I- kjv ~ 0. 
Although a connection between the higher order statis- 
tics and the phase correlations was suggested earlier 
(Bertschinger 1992; Watts, Coles 2003), he discovered an 
important analytic relation between the distribution func- 
tion of the phase sum and the polyspectra using pertur- 
bation theory. 

A subsequent numerical analysis by Hikage et al. (2004) 
explained the behavior of the phase-sum distribution with 
respect to the density structure in real space. When a 
prominent density peak exists in a given sampling volume, 
the Fourier phases of various modes are synchronized to 
have nearly zero values at the position of peak, and thus 
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the phase sum distributes around zero. On the other hand, 
when several peaks with comparable heights exist, the 
synchronization of phases is diluted, and thus the phase- 
sum distribution becomes almost uniform. The phase-sum 
distribution is sensitive to the non-Gaussian feature of the 
density field, especially the relative strength of the most 
high-density peak to other density peaks. The nonlinear 
gravitational evolution and the nonlinearity in the galaxy 
biasing statistically changes the relative strength of den- 
sity peaks in galaxy distribution, and therefore the phase- 
sum distribution is a useful tool to probe the nonlinear 
effects. 

The present analysis applies, for the first time, the 
above phase statistics to the SDSS galaxy catalogs to 
quantify the non-Gaussianity in galaxy distributions. 
There are three possible sources of the non-Gaussianity in 
the present galaxy density field: primordial density field, 
nonlinear gravitational evolution, and nonlinear galaxy 
biasing. Recently, WMAP showed that the primordial 
density field is well approximated by Gaussian statistics. 
Thus, the primordial non-Gaussianity can be safely ig- 
nored in the following analysis, and the prominent source 
of non-Gaussianity in the galaxy distribution is nonlin- 
ear gravitational evolution. In our analysis, we use the 
phase statistics as a cosmological tool independent of the 
two-point statistics. We compare the phase correlations 
of SDSS galaxies with those of mock samples constructed 
from iV-body simulations based on various cosmological 
models. For a fair comparison with observations, we con- 
sider the observational systematic effects including the 
survey geometry, the redshift distortion and the shot- 
noise due to the sparse sampling. We find that the SDSS 
galaxy data agree very well with the LCDM model pre- 
dictions with (Tg — 0.9, which is consistent with the results 
of WMAP. 

Another possible source for non-Gaussianity, galaxy bi- 
asing, is the uncertainty of the statistical relation between 
galaxies and the underlying mass, which originates from 
complicated processes of galaxy formation. It is known 
that galaxy clustering depends sensitively on various prop- 
erties of galaxies, such as the luminosity, color, morphol- 
ogy, and environment (e.g., Dressier 1980). The two-point 
statistics using SDSS galaxies by Zehavi et al. (2005) and 
Tegmark et al. (2004a) clearly exhibits the luminosity and 
morphology dependence of the galaxy biasing (see also 
Kayo et al. 2004 for the dependence in three-point cor- 
relation functions). If the density fluctuation of galaxies, 
Sg(x), is related to the mass density fluctuation, (5m (x), 
as Sg = 5ii5ni, their Fourier transforms also have the lin- 
ear relationship (5g.k = 6i(5m,k- Decomposing the Fourier 
wavevectors to the parts of amplitudes and phases, the 
amplitudes change proportionally to bi regardless of the 
scale; however, the phases are completely preserved un- 
der linear biasing. Therefore, the agreement of the ob- 
served phase-sum distribution with the simulated predic- 
tions suggests that the galaxy biasing is well approximated 
by the linear relation if a correct cosmological model is 
assumed. For definitencss, we adopt a quadratic deter- 
ministic biasing and put a constraint on the degree of the 
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Table 1. 


Properties of our 


volume-limited samples 












^max 


''^samp 

[(/i-iMpc)3] 




^nican 


r 

^box 

[/i-^Mpc] 


Kange oi k 
[27r/(/i-iMpc)] 


-22.0 
-21.0 
-20.0 
-19.0 


-21.0 
-20.0 
-19.0 
-18.0 


0.067 
0.044 
0.028 
0.018 


0.153 
0.103 
0.067 
0.044 


2.92 X 10' 
9.20 X 10^ 
2.69 X 10^ 
7.46 X 10^ 


34008 
44636 
23099 
8640 


9.50 
5.91 
4.88 
4.42 


800 
560 
360 
240 


0.006-0.11 
0.009-0.16 
0.01-0.25 
0.02-0.37 



* Constructed from the SDSS galaxy catalog 'Large-scale Structure Sample 15' in Northern hemisphere. Listed values 
are the upper and lower limits of the r-band magnitude, jjji^/niaxi the upper and lower limits of the redshift, 
•2^min/max 1 t^e Survey volume, Ksamp, the total number of galaxies, Afg^i, the mean separation of galaxies, imean, the 
box-size for Fourier transform, Lboxi and the scale range of k in Fourier space to be used in measuring the distribution 
function of the phase sum. LCDM model parameters, = 0.3 and Qa = 0.7, are assumed throughout. 



nonlinearity of galaxy biasing in a weakly nonlinear regime 
(> 30/i-iMpc). 

This paper is organized as follows. In section 2 we de- 
scribe the SDSS data sets and our simulation mock cat- 
alogs. Section 3 summarizes the perturbative formula of 
phase-sum distribution by Matsubara (2003) and explains 
the method of measuring phase correlations using the dis- 
tribution function of phase sum. The results of observed 
phase correlations are also presented. Section 4 is focused 
on the bispectrum analysis of p^^^ from comparison be- 
tween observations and their mock catalogs. Finally sec- 
tion 5 is devoted to a summary of our results and further 
discussion. 

2. Volume-Limited Samples for SDSS Galaxies 
and Mock Catalogs 

Our present analysis is based on a subset of the SDSS 
galaxy redshift data, 'Large-scale Structure Sample 15' 
(Blanton et al. 2005). This sample includes the spectro- 
scopic data of 389306 galaxies and covers the sky area of 
4426 square degrees. The angular selection function of the 
survey is written in terms of spherical polygons (Hamilton, 
Tegmark 2004). Details of the SDSS can be found in 
the following literature: York et al. (2000) provide an 
overview of the SDSS. Technical articles providing details 
of the SDSS include descriptions of the photometric cam- 
era (Gunn et al. 1998), photometric analysis (Stoughton 
et al. 2002), the photometric system and photometric cali- 
bration (Fukugita et al. 1996; Hogg et al. 2001; Ivezic et al. 
2004; Smith et al. 2002), the photometric pipeline (Lupton 
et al. 2001), astrometric calibration (Pier et al. 2003), 
selection of the galaxy spectroscopic samples (Eisenstein 
et al. 2001; Strauss et al. 2002), and spectroscopic tiling 
(Blanton et al. 2003a). The details of publicly released 
data are summarized in Stoughton et al. (2002) for Early 
Data Release and Abazajian et al. (2003, 2004, 2005) for 
Data Release One, Two and Three, respectively. 

In our analysis we use observational data in the 
Northern hemisphere, except for the distant area from 
the other observed areas, which is designed to overlap 
with the Spitzcr Space Telescope First Look Survey. We 
also omit the area of three stripes in the Southern hemi- 
sphere, which has a survey geometry not appropriate 



for the analysis of galaxy clustering in three-dimensional 
space. The range of the r-band apparent magnitude 
(mr.min, 'Tij-.max) IS sct to be SL Conservative range of 
(14.5, 17.5) after correction for Galactic reddening using 
the maps of Schlegel, Finkbbeiner, and Davis (1998). We 
construct four volume-limited samples with a unit width 
of the absolute magnitude, which cover the range of the r- 
band absolute magnitude from —22 to —18 (Table 1). The 
redshift range of each volume- limited sample (zmim 2;inax) 
is determined by the following distance modulus relation: 

-^^r,min/max ^r,min/max 

- 5l0g[rmin/max(l + ^min/max)/10 pc] (2) 
-^(^niin/max)? 

where r„iin/max is the comoving distance at a redshift of 
■2^min/max 9'iid K{z) is Si quadratic fitting formula of the 
averaged iiT-correction as a function of z (Park et al. 2005), 

K{z) = 2.3537^2 + 0.5735Z - 0.18437. (3) 

The details of A'-corrcction is described in Blanton et 
al. (2003b). Table 1 summarizes the properties of our 
volume-limited subsamples. 

For a fair comparison with the observation, we construct 
a set of mock simulation data, including observational ef- 
fects of the survey geometry, the number density, and 
the redshift distortion (Hikage et al. 2002, 2003). For 
constructing mock samples, we use P^M TV-body simula- 
tions, provided by Jing and Suto (1998). The simulation 
employs 256'^ particles in a periodic comoving box with 
a length of 300ft-^^Mpc or 600/i~^ Mpc using Gaussian 
initial conditions and a Cold Dark Matter (CDM) trans- 
fer function (Bardeen et al. 1986). We use the z = 
snapshot simulation data (for simplicity we neglect the 
light-cone effect) in various CDM models with the param- 
eters listed in Table 2; the dimensionless matter-density 
parameter, fim, the dimensionless cosmological constant, 
J7a, the shape parameter, P of the CDM transfer function 
(Bardeen et al. 1986), and the r.m.s. density- fluctuation 
amplitude smoothed by a top-hat filter with a scale of 
8/i~^Mpc, (Tg. We extract about 10 realizations of wedge 
samples for each volume-limited sample out of the full 
simulation cube so that they have the same sample-shape 
and number of particles (averaged over mock samples) as 
each volume-limited sample. To construct mock samples 
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that extend beyond the simulation box size, we duplicate 
particles using the periodic boundary conditions. We cre- 
ate mock data in rcdshift space by adding the line-of-sight 
component of the peculiar-velocity to the distance of each 
particle for calculating redshift. The mock sample simply 
assumes that each mass particle represents a simulated 
galaxy, and neglects the effect of galaxy biasing, which 
are further discussed in section 4. 



Table 2. Simulation model parameters 



Model 






r 




LCDM 


0.3 


0.7 


0.21 


1, 0.9, 0.7 


SCDM 


1 





0.5 


0.6 


OCDM 


0.3 





0.25 


1 



3. The Distribution Function of Phase Sum 

Matsubara (2003) derived an analytical relation be- 
tween the polyspectra and the distribution of the phase 
sum in perturbation theory. In the lowest-order approx- 
imation, the probability density function (PDF) of the 
phase sum, 0ki + — ^ki-i-k2j over closed wavevectors 
ki,k2, and — ki — k2 reduces to (Matsubara 2003): 

7'(0ki + ^ 6'ki+k2 iKamp) 
3 /2 

OC 1 + ^p'^'(kl,k2|V;amp)cOs(6lki +6lk, - 6'ki+k2),(4) 

where V^amp is the sampling volume and p*-^-* is defined by 
the bispectrum, B(ki,k2), and the power spectrum, P{k), 
as follows: 

p(^)(ki,k2|T/sa.np)= . ^^^''^'^ .(5) 

^ ' V^.ampP(fcl)P(fc2)P(|ki+k2|) ^ ^ 

In the lowest-order approximation, the PDF of the phase 
sum over three wavevectors is fully determined by the 
parameter p^^\ If the hierarchical clustering ansatz is 
valid, B and thereby p'-'^^ is approximately given 

by \/ P/Vse,m-p- Therefore, the lowest-order approxima- 
tion [equation (4)] breaks down at a scale k satisfying 
the condition P(fc)/Vsamp > 1- Note that this is differ- 
ent from the conventional condition of the gravitational 
nonlinear clustering. k^P{k) > 1. In fact, a subsequent 
numerical analysis confirmed the validity of the perturba- 
tive formula, even in a fairly nonlinear regime of clustering 
(Hikage et al. 2004). 

The dependence of the phase-sum distribution on the 
sampling volume can be naturally explained as follows: 
the synchronization of phases becomes weak when multi- 
ple density peaks with comparable heights exist (section 
1). Because the sampling volume is larger, the number 
of peaks increases, and thus the phase-sum distribution 
approaches being uniform. 

When Kiamp is sufficiently large, the higher order terms 
above p^^^ become negligible and then equation (4) is 
applicable, even in nonlinear regimes, in the sense that 



k^P{k) > 1. On the contrary, decreasing Ksamp by divid- 
ing the original survey volume into several sub-volumes 
would provide information about the higher-order spec- 
trum. Dividing the original sample, however, leads to 
decreasing the number of objects per sub-sample. Thus, 
a significant fraction of the available Fourier modes suf- 
fers from the contamination of shot noise. In the present 
galaxy sample, it is quite difficult to extract higher order 
information beyond p^^^ from the distribution function of 
the three-point phase sum. In the present work, therefore, 
we do not divide the survey volume of each volume-limited 
sample. 

We compute the distribution function of the phase sum 
for density contrast fields from mock samples and SDSS 
galaxies. We use the cloud-in-ccU interpolation to assign 
galaxies or dark matter particles to mesh densities. Then, 
the density contrast field is Fourier-transformed on cubic 
grids with a mesh number A^mesh = 256 per side. We cal- 
culate the sum of the Fourier phases for the closed set 
of three wavevectors, and then compute the distribution 
function of the phase sum within each binning range of tri- 
angle configurations. For simplicity, we focus on the con- 
figuration of wavevectors to be a nearly isosceles triangle, 
where the absolute values of two wavevectors, ki and fc2, 
are within the same binning range of scale k. The scale k is 
divided into 8 bins with equal width in logarithmic scale in 
the range of a < /c/ZcNyq < b. The Nyquist frequency /cNyq 
is equal to 27r x (iVmcsii/2)/ibox, where Lbox is a side of a 
cubic box for a Fourier transformation. Table 1 lists Lbox 
for each volume-limited sample, which is chosen for the 
sample volume to be barely covered. We limit the range 
of scale to be a = 0.04 and h = 0.7 so that the observational 
systematics due to the complicated survey geometry and 
the discreteness effect may be neglected compared to the 
sample variance; at smaller fc, the Poisson error increases 
due to the limited number of independent combinations of 
modes forming closed triangles, and furthermore the con- 
volution effect with the survey mask is more serious. At 
larger k, the shot noise becomes more serious. Table 1 lists 
the range of scale k for each volume-limited sample. The 
unit of scale k in Fourier space is set to be 27r/(/i~^Mpc), 
and thereby the corresponding scale-length in real space 
is just the inverse of k in units of /i~^Mpc. The angle tp 
between ki and k2, i.e., (p = arccos[(ki/A:i) • (k2/fc2)] with 
0° < < 180°, is binned to have an equal width of 20°. 

Figure 1 plots the distribution function of the phase sum 
for each volume- limited sample in symbols. For compari- 
son, we plot in lines the lowest-order approximation [equa- 
tion (4)] using p^'^^ computed from the bispectrum and the 
power spectrum (5). In the left panels, the triangle config- 
urations are nearly equilateral shape (110° <Lp< 130°) for 
different k. In the right panels, the triangle configurations 
are nearly isosceles for different angles, ip. The phase-sum 
distributions are found to be well approximated by the 
lowest-order approximation for all of the volume-limited 
samples, regardless of the triangle configurations. The 
dependence of p*-"^^ on Kiamp and k can be understood 
from the nature of perturbative parameter, p^^\ which 
is roughly proportional to [i-'(fc)/ysamp]^^^ under the hi- 
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Fig. 1. Comparison of the observed distribution function of the phase sum (symbols) with the lowest-order approximation [equation 
(4)] where p(^) is computed by the combination of the bispectrum and the power spectrum from observations [equation (5)] (lines). 
Different panels show the results for different volume-limited samples. In left figure, the configurations of triangle wavevectors are 
focused on nearly equilateral triangle (110° <ip < 130°) with different scales of fc ~ fci ~ fc2. In right figure, the triangle configurations 
are nearly isosceles (fc ~ fci ~ k2) with different angles ip between ki and fc2. The unit of scale k is set to be 2tt/ {h~^yipc). 



erarchical clustering ansatz; volume- limited samples for 
fainter galaxies have a smaller T4amp (increase p^^^ ), but a 
smaller P{k) (decrease p*-^-* ) because the measurable range 
of scale k is shifted to larger values due to the smaller 
Vbox- The redshift distortion due to the random motion 
of galaxies also smears the power at small scales, and thus 
p^^^ reaches up to 0.3 at a maximum, much smaller than 
unity. Figure 1 shows that p*-^-* calculated from the am- 
plitude of the phase-sum distribution using equation (4) 
is nearly equal to that from the combination of the bis- 
pectrum and the power spectrum [equation (5)]. In what 
follows, we will use equation (5) to compute p'^^ instead 
of fitting the PDF to equation (4). 

4. Bispectrum Analysis of p^^^ 

Figures 2 and 3 show p^^^ for SDSS galaxies in differ- 
ent volume-limited samples (plotted in symbols) against 
the wavenumber k (Figure 2) and the angle ip (Figure 3). 
For comparison, the LCDM predictions with as = 0.9 are 
plotted in lines. Clearly, the scale-dependence of p^^^ is 
well approximated by a power-law of k. This can be un- 
derstood again from equation (5); because B{ki,k2) and 
P{k) may be approximated as power-laws at those scales, 
we expect p'^' cx P{k) oc k^^. The right panels show the 
ratio of the observed p'^' to the corresponding mock esti- 
mations in the left panels. The overall agreement between 
observations and LCDM predictions with erg = 0.9 is very 
good. 

Figures 4 and 5 show comparisons of the observed p*-^-* 
with various mock predictions to examine the cosmology 
dependence. Figure 4 focuses on the scale dependence. 



while Figure 5 focuses on the shape dependence (the an- 
gle ip of the triangles). The right panels show the ratio 
of p^^' between observations and the corresponding mock 
samples in the left panels. Again, LCDM predictions with 
(78 = 0.9 show the best agreement with the observations 
among our mock samples; the SCDM model predicts a 
smaller p^^^ than the other models over the whole range 
of scales, mainly because of the smaller value of erg (see 
Table 2). The OCDM model has the same value of as 
the LCDM models, but their spectral shape is different: 
the small-scale fluctuation in OCDM is slightly larger due 
to the larger value of F, and thus OCDM predicts higher 
values of phase correlations than LCDM. Figures 6 and 7 
are same as Figures 4 and 5, but for the (Tg dependence us- 
ing mock samples based on the same cosmology of LCDM. 
A systematic increase of p^^^ as erg is clearly found because 
the phase correlations become strong as the gravitational 
evolution proceeds. 

Galaxy biasing is another source of uncertainty in com- 
paring the observations and the mock predictions. If the 
galaxy biasing is linear, i.e., (5g = 6i(5ni, the amplitude 
changes in proportion to the linear coefficient, bi, but the 
phases are invariant. Thus, the consistency of the LCDM 
model remains valid as long as the galaxy biasing is well 
approximated by the linear relation in redshift space (note 
that this does not strictly correspond to the linear biasing 
in real space). The analysis of the bispectrum for 2dF- 
GRS finds that galaxy biasing is consistent to be linear 
under LCDM model in the range of scales from 5/i^^Mpc 
to 30/i-iMpc (Verde et al. 2002). 

Indeed, the combined analysis of three-point correlation 
functions and two-point correlation functions shows a sig- 
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of mock samples based on the representative model of LCDM with eg = 0.9 are plotted by lines. The right figures show 
the ratio of the observed p^"^' and the corresponding mock estimations. The configurations of triangle wavevectors are equi- 
lateral triangles with different scales of k. Error-bars represent the sample variance of the mock samples for each sample. 
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Fig. 6. Same as Figure 4, but for compared mock samples based on LCDM models with erg = 1 (filled cir- 
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nificant deviation from linear biasing at nonlinear scales 
from l/i^^Mpc to 10ft.~^Mpc (Kayo et al. 2004). In con- 
trast, the scales that we probe in the present analysis is 
much larger, i.e., > 30ft.~^Mpc. We therefore parameter- 
ize the galaxy biasing by the local and quadratic relation 
between the density fluctuation of galaxies, (5g, and that 
of mass, (5in, as follows; 



(6) 



where hi and 62 are biasing parameters. In a weakly non- 
linear regime, for example at scales larger than 30/i^^Mpc, 
the locality of the galaxy biasing is valid because the 
scale of galaxy formation is much smaller Mpc). Also 
the density fluctuation is less than unity, and thereby 
higher order terms of the density fluctuation are negli- 
gible. Under the above quadratic biasing, pg for galaxies 
and for mass are related as 



f{Pl,P2,P3) 



P1P2 



hi 

P2P3 + P3P1 



/(Pl,P2,P3), 



(7) 



(8) 



■\/ V'samp Pi P2 P3 

where Pi = P(fci),P2 - ^(^2), and P3 = P(|ki -I- k2|). 
Equation (7) implies that the difference of from pm' 
is proportional to the ratio 62/^1 alone. 

Figure 8 plots 62/^1 calculated from the difference be- 

(3) (3) 

tween pg and pm for each cosmology using equation (7). 
The plotted values of &2/&1 are averaged over scales larger 
than 30/i-^Mpc (fc < 0.03[27r/(/i-iMpc)]). In constraining 
62/61, we do not use the phase sum for the configuration 
of triangle wavevectors with the opening angle, (p < 90°, 
because we find that they strongly suffer from the obser- 
vational systematic due to the survey geometry beyond 
the sample variance. We note that the ratio of the biasing 
parameters in Figure 8 is calculated from the density field 
in redshift space and thus their values are quantitatively 
different in real space. 

The results in Figure 8 suggest that the nonlinearity 
of the galaxy biasing, 62/61, is generally rather small and 
simply dependent on a^, regardless of the cosmology and 
galaxy luminosity. We thereby plot the relation between 
as and 62/61 averaged over all of four volume-limited sam- 
ples in Figure 9. Indeed, we find that as mainly deter- 
mines the resulting 62/61 regardless of cosmology at scales 
> 30/i~^Mpc, and that their relation is well fitted by the 
following linear relation: 

62/61 = (0.54 ± 0.06) - (0.62 ± 0.08)(T8. (9) 

The linear relation is valid if [62/61 1 ^ I and the power 
spectrum of the mass fluctuation is mainly determined by 
the square of as at > 30/i~^Mpc; both pm"* and / (equation 
7) are roughly given by y/ P/Vsamp, which is proportional 
to as in a nearly linear regime. Thus, 62/61 is linearly 
related to as as long as I62/61I is much smaller than unity. 

When we adopt a value of erg ~ 0.9, measured from the 
combined analysis of recent WMAP results and the power 
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spectrum of SDSS galaxies (Spergel et al. 2003; Tegmark 
et al. 2004b), 62/61 = -0.02 ±0.1. Therefore, we conclude 
that the galaxy biasing is well approximated by a linear 
relation. Complementary measurements of (Ts from other 
observations which are independent of the galaxy biasing, 
including weak gravitational lensing and galaxy clusters, 
will put additional constraints on the nonlinearity of the 
galaxy biasing. 

5. Summary and Conclusions 

We have performed the first measurement of the distri- 
bution function of the phase sum of galaxies. We applied 
the phase statistics to volume-limited samples of the lat- 
est SDSS galaxy catalogs. We found that the distribution 
functions of the phase sum for all of the SDSS galaxy 
samples are in good agreement with the lowest-order per- 
turbation formula. Since the latter is characterized by 
p^^^ , we explored various properties of p^^^ . For a quanti- 
tative comparison with observations, we constructed real- 
istic mock catalogs from A^-body simulations. We found 
that the observational phase correlations agree best with 
the LCDM model with o-g = 0.9 if galaxy biasing is linear; 
the SCDM model predicts weaker correlations of phases 
than observations overall, mainly due to the small erg. 
Actually, we found a systematic increase of p^^^ as ag in- 
creases with the same parameters of the LCDM model. 

Assuming that galaxy biasing is expressed by a 
quadratic deterministic relation in a weakly nonlinear 
regime {k < O.G3[27r/(/i~^Mpc)]), we placed constraints on 
a linear combination of the nonlinearity in the galaxy bi- 
asing 62/61 and cTg as 62/61 =0.54(±0.06)-0.62(±0.08)f78. 
Using a recent measurements of cg ^ 0.9 from a com- 
bined analysis of WMAP and SDSS, the galaxy biasing 
was found to be well approximated by a linear relation. 

In the present analysis, we have focused on the behavior 
of p*-'^-' , which carries phase information contained in the 
bispectrum (Scoccimarro et al. 2001; Scoccimarro 2000). 
In this sense, our analysis is equivalent to an analysis of 
the bispectrum, which is also a first attempt using SDSS 
data. Higher order information beyond bispectrum can 
be obtained by measuring the distribution function of the 
phase sum over more than four numbers of wavevectors 
which form closed polygons. Another approach is to de- 
crease K;ainp, bccausc the perturbative parameter, p^^\ 
is inversely proportional to -^/V^amp- However, extract- 
ing higher order information is limited mainly by the shot 
noise, as discussed in section 3. Therefore, deeper and 
denser surveys than SDSS are required to carry out this 
attempt in reality. 
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